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(54) Abstract Title 

Archiving network usage data within a predetermined period 

(57) A method for archiving data within a predetermined time interval, using an archiver connected to a data 
collector 18 having a list 26 of available data sets of an Internet Usage Manager 14, includes the steps of 
obtaining a list of available data sets from the data collector, reading data for a data set from the list, 
associating read data to the data set, converting all the read data to a predefined archive format once data 
from each available data set in the list is read, saving the converted data to a file 32, and waiting for a 
predetermined time to repeat the foregoing steps. 
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1 Inventors: Pierre Perinet; Eric Peterson 

2 METHOD AND SYSTEM FOR ARCHIVING DATA WITHIN A 

3 PREDETERMINED TIME INTERVAL 

4 The present invention generally relates to an improved method 

5 and system for archiving data. More specifically, it relates to an improved 

6 method and system for archiving data using an archiver connected to a data 

7 collector having a hst of available data sets of an Internet Usage Manager 

8 within a predetermined time interval. 

9 BACKGROUND OF THE INVENTIVE ART 

10 Because Internet servers can provide valuable information about 

1 1 their users, currently many software applications are designed to collect such 

12 usage data. The data includes important information relating to items, such as 

13 usage measxire, geogmphical information, and user service requests. For 

14 example, the data can provide valuable information for a business manager in 
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1 trying to understand the usage behavior of users, identify needs for new 

2 services, managing the pricing of subscription plans, and determining profit 

3 margin. All this information can provide managers with valuable marketing 

4 tools. 

5 Software applications for collecting usage data are generally 

6 known as Internet Usage Managers ("lUM"). The present invention is 

7 designed specifically for an lUM disclosed in a commonly owned U.S. Patent 

8 application filed on April 12, 2000 entitled "Internet Usage Analysis System 

9 And Method" bearing Serial No. O^/c^S'.Jt'f by Lee Rhodes, 

10 assigned to HP company. This patent appUcation is specifically incorporated 

1 1 by reference herein. 

j2 An lUM typically includes a data coUector for saving any user 

13 data relating to the server. The data collector is set up to collect data 

14 continuously. As a result, the usage data is not divided into various time 

1 5 intervals, which could be extremely valuable information relating to the users. 

16 For example, managers can track the behavior and traffic of the server in a 

17 particular month, week or day. However, smce the data collector collects 

18 information continuously, this type of desired information is difficult to 

19 achieve using the data collector of the lUM. As a result, there is need for an 

20 improved lUM that collects data within a predetermined time interval selected 

21 by users. More specificaUy, an archiver connected to the lUM that can 

22 automatically archive data collected by the data collector according to a 

23 predetermined time interval is needed- 



BRIEF SUMMARY OF THE INVENTION 
The present invention is directed to an improved method and 
26 system for archiving data. More specifically, it relates to an improved method 
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1 and system for archiving data using an archiver connected to a data collector 

2 having a list of available data sets of an Internet Usage Manager within a 

3 predetermined time interval. 

4 The present invention provides a method that includes the steps 

5 of obtaining a list of available data sets from the data collector, reading data for 

6 a data set from the list, associating read data to the data set, converting all the 

7 read data to a predefined archive fonnat once data from each available data set 

8 in the list is read, saving the converted data to a fUe and waiting for a 

9 predetermined time to repeat the foregoing'steps. 

10 The present invention also provides a system that includes an 

11 Internet Usage Manager system for managing server usage data, a data 

1 2 collector of the lUM for collecting the usage data, a list of data sets defined by 

13 at least one category, an archiver for archiving the usage data within a 

14 predetermined time interval, and a client for an user interface for selecting and 

1 5 displaying the usage data. 



16 DESCRIPTION OF THE DRAWINGS 

17 FIGURE 1 is an architectural diagram of an implementation 

1 8 using the presCTt invention; 

19 FIG. 2 is a flow chart illustrating the preferred functionality of a 

20 method of the present invention; 

2 1 FIG. 3 is an exemplary page displayed on the client using the data 

22 archived within a predetermined time interval; and, 

23 FIG. 4 is anotfier exemplary page displayed on the client using 

24 the data archived within a predetermined time interval. 
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GLOSSARY OF TERMS AND ACRONYMS 

2 nie following teniis and acronyms are used throughout the 

3 detailed description: 

4 Archiver. A computer for archiving data collected by the data 

5 collector of an Internet Usage Manager system within a predetermined time 

6 interval. 

7 Archive. A single file containing one or more separate files plus 

8 information into a format, such as XML or Binary, that allows them to be 

9 extracted by a suitable program. 

jQ Binary data. A file format for digital data encoded as a sequence 

11 of bits but not consisting of a sequence of printable characters (text). The term 

12 is often used for executable machine code. 

13 Common Object Request Broker Architecture ("CORBA"). An 

14 Object Management Group ("OMG") specification which provides the standard 

1 5 interface definition between OMG-compliant objects. 

Data Collector. A module in the Internet Manger Usage system 

17 that continuously collect usage data of the server. 

Extensible Markup Language ("XML"). An initiative from the 
W3C defining an "extremely simple" dialect of SGML suitable for use on the 

20 World-Wide Web. 

21 Hyperlink. A navigational link firom one document to another, 
ftom one portion (or component) of a document to another, or to a Web 
resource, such as a Java applet Typically, a hyperlink is displayed as a 

24 highlighted word or phrase that can be selected by clicking on it using a mouse 

25 to jump to the associated document or document portion or to retrieve a 

26 particular resource. 
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1 HTML (HyperText Markup Language). A standard coding 

2 convention and set of codes for attaching presentation and linking attributes to 

3 infonnational content within documents. (HTML 2.0 is currently the primary 

4 standard used for generating Web documents,) During a dociunent authoring 

5 stage, the HTML codes (referred to as "tags") are embedded within the 

6 infonnational content of the document When the Web document (or HTML 

7 document) is subsequently transferred fiora a Web server to a browser, the 

8 codes are interpreted by the browser and used to display the dociunent 

9 Additionally in specifying how the Web browser is to display the document, 

10 HTML tags can be used to create lioks to other Web documents (commonly 

11 referred to as "hyperlinks"). For more information on HTML, see Ian S. 

12 Graham, The HTML Source Book, John Wiley and Sons, Inc., 1995 (ISBN 

13 0471-11894-4). 

14 Hyper Text Transport Protocol ("HTTP"), The standard World 

1 5 Wide Web client-server protocol used for the exchange of information (such as 

16 HTML documents, and client requests for such documents) between a browser 

17 and a Web server. HTTP includes a number of different types of requests, 

18 which can be sent from the client to the server to request different types of 

19 sender actions. For example, a "GET" request, which has the format GET 

20 <URL>, causes the server to return the dociunent or file located at the specified 

21 URL. 

22 Internet. A collection of interconnected or disconnected networks 

23 (public and/or private) that are linked together by a set of standard protocols 

24 (such as TCP/IP and HTTP) to fomi a global, distributed network. (While this 

25 term is intended to refer to what is now commonly known as the Intemet, it is 

26 also intended to encompass variations which may be made in the future, 

27 including changes and additions to existing standard protocols). 
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1 Internet Usage Manager ("lUM"). A computer implemented 

2 system for managing usage data of the server. 

3 Object Management Group ("OMG"). A consortium aimed at 

4 setting standards in object-oriented programming. 

5 Object-Oriented Programming. The use of a class of 

6 programming languages and techniques based on the concept of an "object" 

7 which is a data structure (abstract data type) encapsulated with a set of routines 

8 that operates on the data. Operations on the data can only be performed via the 

9 routine sets. These routine sets are common to all objects that are instances of 

10 a particular "class". As a result, the interface to objects is well defined, and 

11 aUows the code implementing the routine sets to be changed so long as the 

12 interface remains the same. 

13 standard GeneraUzed Markup Language ("SGML"). A generic 

14 markup language for representing documents. SGML is an International 

15 Standard that describes the relationship between a document's content and its 

16 structure. SGML allows document-based information to be shared and re-used 

17 across applications and computer platforms in an open, vendor-neutral format. 

1 8 URL (Uniform Resource Locator). A unique address which fully 

19 specifies the location of a file or other resource on the Internet or a networL 

20 The general format of a URL is protocol: //machine address:port/path/filename. 

21 Usage Data. Data collected by the lUM relating, among other 

22 things, to information on users, sessions and usage. 

23 World Wide Web ("Web"). Used herein to refer generally to both 

24 (i) a distributed collection of interlinked, user-viewable hypertext documents 

25 (commonly referred to as Web documents or Web pages) that are accessible via 

26 the Internet, and (ii) the client and server software components which provide 

27 user access to such documents using standardized Internet protocols. Currently, 
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1 the primary standard protocol for allowing applications to locate and acquire 

2 Web documents is HTTP, and the Web pages are encoded using HTML. 

3 However, the terms "Web" and "World Wide Web" are intended to encompass 

4 future markup languages and transport protocols which may be used in place of 

5 (or in addition to) HTML and HTTP. 

6 DETAILED DESCRIPTION 

7 Broadly stated, the present invention is directed to an improved 

8 method and sj^tem for archiving data. The method and system provide an 

9 archiver for archiving data collected by a data collector of an lUM within a 

10 predetermined time interval. Because the archiver can be set up separately to 

1 1 collect data within a predetermined time interval, usage data can be displayed 

12 to users according to years, months, weeks, days, hours, minutes and seconds. 

13 Usage information provided in such a context can greatly assist managers in 

1 4 understanding the behavior and usage of users. 

15 An architectural diagram of an implementation using the present 

16 invention with an lUM is shown in FIG. 1, and indicated generally at 10. An 

17 archiver 12 is connected between an lUM 14 and a client 16* The lUM 14 is a 

18 computer for managing server statistical usage data, and includes a data 

19 collector 18 for collecting usage data 20 of users xising a HTTP server 22 with 

20 specific server configurations 24. It should be noted that the preferred 

21 implementation of the present invention is for use with Internet servers (e.g. 

22 HTTP servers). However, other servers such as intranet or network servers can 

23 be used These other implementations with the use of other types of servers are 

24 within the scope of the present invention* A list of a plurality of data sets 26 

25 defined by at least one category is also included with the lUM. Data collected 

26 by the data collector is grouped according to each data set in the lists. 
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The lUM 14 is preferably linked to the archiver 12 and the cUent 
16 via a CORBA connection 28, 28'. Using the settings defined in the 
configuration file 30. the archiver 12 automatically archives data firom the data 
collector within a piedetennined time interval. The archived data 32 is then 
saved locaUy. Because the archiver 12 also services the client 16, a HTTP 
server 34 is preferably used for storing the archived data 32 and servicing the 
7 client 

The client 16, on the other hand, is a user interface for displaying 
the usage data collected by the archiver 12 or the data collector 18 of the lUM 
14. If a user desires usage data within a predetermined time interval, the client 
gathers the needed data from the archive. However, the client 16 can also 
access the data collector 18. In fact, the client can access data 36 saved locally 
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13 onthecUent 

Although it is shown that the archiver 12, the lUM 14 and the 
client 16 are located on different computers, they can be combined together in 
any number of computers depending on the preferred implementation, to fact, 
as known by those of ordinary skilled, the network topology of the present 
invention can be implemented in various ways, and these other 

19 implementations are within the scope of the present invention. 

20 Turning to an important aspect of the preferred embodiment of 

21 the present invention, a flow chart of the preferred functionality of a 

22 configuration method is shown in HG. 2. and indicated generally at 50. At the 

23 start of the archiver (Block 52). it first obtains configuration settings fiom the 

24 configuration file (Block 54). which preferably includes a URL of the data 

25 collector, a maximum nmnber of files allowed and a predetermined time 

26 interval for coUecting the data. Other infomiation and settings relating to the 
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1 archiver can also be included in the configuration file. Next, the archiver 

2 obtains a list of available data sets firom the data collector (Block 56). 

3 The archiver will archive data associated with each data set in the 

4 list More specifically, the archiver reads data for the first data set fi*om the list 

5 (block 58). The data will be associated with this first data set (block 60). It is 

6 next determined whether there is a next data set in the list (block 62). If there 

7 is, the archiver will continue to read the next data set firom the list (block 64) 

8 and an association of the data will similarly be made with this next data set 

9 (Block 60). The archiver continues reading the data for each data set firom the 

10 list (block 64) until a next data set is no longer available in the list (block 62). 

1 1 Once all the data has been read for every data set in the list (block 62), the read 

12 data is converted to a predefined archive format (block 66). The preferred 

13 archive format is preferably either binary or XML. However, any archive 

1 4 format can be used, and they are within the scope of the present invention. 

15 Before the converted data is saved locally on the archiver, it is 

16 first determined whether there is already the maximum number of files allowed 

17 (Block 68). A maximum number of files allowed is preferably provided for in 

18 the configuration file to avoid having endless megabytes of files that 

19 unnecessarily take up memory of the archiver. Although it is not essential to 

20 the method, it is preferred that some type of maintenance for the archiver is 

2 1 included with the present method. If the maximum nxmiber of files allowed are 

22 already in the system (block 68), the converted data will be saved by replacing 

23 it in the oldest file allowed in the system (block 70). On the other hand, if the 

24 maximum number of files allowed are not ahready in the system (block 68), the 

25 converted data will then be saved onto a new file (block 72). Once the 

26 converted data has been saved, the archiver will log the date and time to 

27 indicate when the data was collected and archived &om the data collector of the 

9 



1 lUM (block 74). The archiever will then wait for a predetermined time from 

2 the last logged date and time (block 76) to restart the process from the step of 



obtaining a list of available data sets from the data collector (block 56). 



3 

4 Exemplary pages displayed on the client using the data archived 

5 within a predeteraiined time interval by the archiver is shown in FIGS. 3 and 4. 

6 Because the archiver collects data according to a predetermined time interval, 

7 such as once a day, various histograms, graphs and charts can be viewed within 

8 a time interval set by users. Near the top of the screen, users can choose 

9 specific parameters relating to categories of model type, measure, time interval, 

1 0 geographical location, user plan or user service. In this example, the data set is 

1 1 defined as "distribution" for model type, "usage" for measure, "last 30 days" for 

12 time interval, "all" for geographical location, "bronze" for user plan and "web" 

13 for user service. 

j4 A single data set with these specific parameters is defined in the 

15 Ust. For example, if the category time interval is changed to last week with the 

16 other categories remaining the same, another data set is defined for these 

17 parameters. Thus, in the list of the lUM, the preferred embodiment has 

18 thousands of data sets. However, as shown, both examples in FIGS. 3 and 4 

19 define the same data set, and the only difference is the type of report the user 

20 selected. More specifically, a cumulative % users for FIG. 3 and a historical 

21 summary for FIG. 4. 

22 Because the archiver automatically archives the data within a 

23 predetermined time interval, the client is able to view data according to various 

24 time interval, which is not available through the data collector of the lUM. As 

25 a result, managers can view usage data that are more relevant to marketing. 

26 The archiver provides an additional dimension to lUM systems for providing 

27 an improved and vahiable marketing tool. 

10 



1 From the foregoing description, it should be understood that an 

2 improved method and system for archiving data for use with an lUM has been 

3 shown and described, which has many desirable attributes and advantages. The 

4 method and system provide users with data according to a predetermined time 

5 interval on the lUM. As a result, a time interval category is added to the lUM 

6 systems, which provides important data information. Thus, the archiver 

7 provides an additional dimension for data archiving in lUM systems. 

8 While various embodiments of the present invention have been 

9 shown and described, it should be understood that other modifications, 

10 substitutions and alternatives are apparent to one of ordinary skill in the art. 

11 Such modifications, substitutions and alternatives can be made without 

12 dq)arting fi-om the spirit and scope of die invention, which should be 

1 3 determined fi^om the appended claims. 

14 Various features of the invention are set forth in the appended 

15 claims. 
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WHAT IS CLAIMED IS: 



J 1. A metiiod for archiving data using an archiver connected 

2 to a data collector having a Ust of available data sets of an Internet Usage 

3 Manager within a predetermined time interval, comprising the steps of: 

4 obtaining a list of available data sets from the data collector, 

5 reading data for a data set from tiie list; 

6 associating read data to the data set; 

7 converting all the read data to a predefined archive format once 

8 data from each available data set in the list is read; 

9 saving the converted data to a file; and, 

10 waiting for a predetermined time to repeat the foregoing steps. 

1 2. The method according to claim 1 wherein said archiver is 

2 configured for use with an Internet Usage Manager system. 

1 3. The method according to claim 1 wherein said data set 

2 includes any one from the category of model type, measure, time interval, 

3 geographical location, user plan or user service. 

1 4. The method according to claim 1 wherem the converted 

2 data is in Binary Language or Extensible Maricup Language. 

3 5. The method according to claim 1 wherein prior to the step 

4 of obtaining a list of available data sets further comprises the step of obtaining 

5 configuration settings from a configuration file for the archiver. 

1 6. The method according to claim 5 wherein said 

2 configuration file comprises an uniform resource locator of the data collector, a 
maximum number of files allowed, and a predetennined time interval for 
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4 collecting the data. 
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1 7. The method according to claim 1 wherein said step of 

2 reading data for a data set from the list further comprises the step of: 

3 reading a first data set from the list; 

4 associating read data to the first data set; 

5 determining whether there is a next data set available in the list; 

6 reading the next data set from the list when there is a next data set 

7 available in the list; and, 

8 converting the read data once there is no next data set available in 

9 the list 

1 8. The me&od according to claim 1 wherein said step of 

2 saving the converting data to a file further comprises the steps of: 

3 determining whetiier there is already a maximum number of files 

4 allowed; 

5 replacing the oldest file with the converted data if there is akeady 

6 a maximum number of files allowed; 

7 saving the converted data to a new file if there is not a maximmn 

8 number of files allowed. 

1 9. The method according to claim 1 wherein prior to said 

2 step of waiting for a predeteraiined time further comprising the step of logging 

3 the ciirrent date and time. 

1 10. A computer program product comprising a computer 

2 usable mediimi having computer readable program codes embodied in the 

3 medium that when executed causes a computer to: 

4 obtain a list of available data sets from the data collector, 

5 read data for a data set from the list; 

6 associate read data to the data set; 

13 



7 convert aU the read data to a predefined archive format once data 

8 for each available data set in the list is read; 

9 save die converting data to a file; and, 

jQ wait for a predetermined time to repeat the process. 

11. An archiver for archiving data within a predetermined 
time interval connected to an Internet Usage Manager system having a data 
collector for collecting available data for a list of data sets to: 

obtain a list of available data sets from the data coUector, 
read data for a data set from the list; 

6 associate read data to die data set; 

7 convert all the read data to a predefined archive format once data 

8 for each available data set in the list is read; 

9 save the converting data to a file; and, 

jQ wait for a predetennined time to repeat the process. 

1 12. A system for archiving data using an archiver connected to 

2 a data collector having a list of available data sets of an Internet Usage 

3 Manager within a predetennined time interval, comprising: 

4 an hitemet Usage Manager system for managing usage data of a 

5 server; 

6 a data collector of said Internet Usage Manager system for 

7 collecting the usage data; 

8 a list of data sets defined by at least one category; 

9 an archiver for archiving the usage data within a predetermined 

10 time interval; 

J J a client for an user interface for selecting and displaying the 

12 usage data. 
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